Picture for Xiaosong Jia

Xiaosong Jia

Resolving Representation Ambiguity in Feedforward Novel View Synthesis Transformer via Semantic-Spatial Decoupling

Add code
May 18, 2026
Viaarxiv icon

GuidedVLA: Specifying Task-Relevant Factors via Plug-and-Play Action Attention Specialization

Add code
May 12, 2026
Viaarxiv icon

SWIFT: Prompt-Adaptive Memory for Efficient Interactive Long Video Generation

Add code
May 10, 2026
Viaarxiv icon

Attention Itself Could Retrieve.RetrieveVGGT: Training-Free Long Context Streaming 3D Reconstruction via Query-Key Similarity Retrieval

Add code
May 10, 2026
Viaarxiv icon

Bench2Drive-VL: Benchmarks for Closed-Loop Autonomous Driving with Vision-Language Models

Add code
Apr 01, 2026
Viaarxiv icon

Can Users Specify Driving Speed? Bench2Drive-Speed: Benchmark and Baselines for Desired-Speed Conditioned Autonomous Driving

Add code
Mar 26, 2026
Viaarxiv icon

ACE-Brain-0: Spatial Intelligence as a Shared Scaffold for Universal Embodiments

Add code
Mar 03, 2026
Viaarxiv icon

PointAlign: Feature-Level Alignment Regularization for 3D Vision-Language Models

Add code
Feb 28, 2026
Viaarxiv icon

Efficient-LVSM: Faster, Cheaper, and Better Large View Synthesis Model via Decoupled Co-Refinement Attention

Add code
Feb 06, 2026
Viaarxiv icon

Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank

Add code
Dec 13, 2025
Figure 1 for Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank
Figure 2 for Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank
Figure 3 for Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank
Figure 4 for Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank
Viaarxiv icon